Skip to main content

Full text of "An inequality for correlated measurable functions"

See other formats



Abstract. A classical inequality, which is known for families of monotone functions, is generalized 
to a larger class of families of measurable functions. Moreover we characterize all the families of 
functions for which the equality holds. We give two applications of this result, one of them to a 
problem arising from probability theory. 

1. Introduction 

The aim of this paper is to generahze an inequaUty, originally due to Chebyshev and then 
rediscovered by Stein in [3]. Usually this result is stated for monotonic real functions: the classical 
inequality is 

where / and g are monotonic (in the same sense) real functions (see for instance ^ and ^ for a 
more general version). If a = 6 — 1 then this inequality has a probabilistic interpretation, namely 
E[/g'] — E[/]E[(/] > (where E denotes the expectation), that is, the covariance of / and g is 

Our approach allows us to prove the inequality for functions defined on a general measurable 
space, hence we go beyond the usual ordered set R. More precisely, we prove an analogous result 
for general families of measurable functions that we call correlated functions (see Definition 12.11 for 
details). In particular we characterize all the families of functions for which the equality holds. 

Here is the outline of the paper. In Section [5] we introduce the terminology and the main tools 
needed in the sequel. In particular Sections 12. II and are devoted to the construction of an order 
relation and a c-algebra on a particular quotient space. In Section [3] we state and prove our main 
result (Theorem 13. 1|) which involves k correlated functions; the special case k = 2 requires weaker 
assumptions (see also Remark 13. ip . We give two applications of this inequality in Section H) the 
first one involves a particular class of power series, while the second one comes from probability 

We start from a very general setting. Let us consider a set X, a partially ordered space (y, >y) 
and a family M = {/jjier (where F is an arbitrary set) of functions in . We consider the 
equivalence relation on X 

and we denote by X/^ the quotient space, by [x] the equivalence class of x E X and by tt the 
natural projection of X onto X/^. Roughly speaking, by means of this procedure, we identify 
points in X which are not separated by the family TV. 

2. Preliminaries and basic constructions 


h{x)=fi{y), Vier 

2000 Mathematics Subject Classification. 26D15, 28A25. 

Key words and phrases, integral inequalities, measure, cartesian product, ordered set. 

To the family M corresponds a natural counterpart A/"^ = {</)/. jjgr of functions in where, 
by definition, i;^>/([x]) := /(x), for all x G X and for every / G satisfying 

(2.1) Vx,yGX:x~y^/(x) = /(y) 

(this holds in particular for all the functions in TV). It is clear that the family 7V^ separates the 
points of X/ r^. 

Given any function g defined on X/^ we denote by Hg the function g o n; observe that (pT^^ = g for 
all g G and tt,^^ = / for every / satisfying equation ()2.ip . Clearly g i— > vr^ is a bijection from 

y^/~ onto the subset of function in Y-^ satisfying equation (j2.ip . 

Note that given /, fi G Y^ which satisfy equation ()2.ip (resp. g, g\ G Y^l-) then / >y /i 
(resp. g>Y gi) implies (pf >(pfi (resp. -Kg > vr^J. 

2.1. Induced order. In order to prove Theorem 13. II we cannot take advantage, as in the classical 
formulation, of an order relation on the set X. Under some reasonable assumptions (se Definition l2.1l 
below) we can transfer the order relation from Y to X/r^ where we already defined a family Af^ 
related to the original Af. This will be enough for our purposes. 

Definition 2.1. The functions in M are correlated if, for all i £T and x,y € X, 

(2.2) Mx) >Y fi{y) =^ fj{x) >Y /,(y), Vj G r. 

We note that the definition above can be equivalently stated as follows: for all i,j G T and 

X G X, 


Besides, if y = M with its natural order, then the functions in M are correlated if and only if for 
all i,j G r and x,y G X, 

(2.3) {m-m){Mx)-My))>0. 

In particular if X is a totally ordered set and all the functions in J\f are nondecreasing (or nonin- 
creasing) then they are correlated. 

A family of correlated functions induces a natural order relation on the quotient space X/^. 

Lemma 2.1. If the functions in Af are correlated then the relation on X/^ 

[x] >^ [y] ^ fi{x) >Y f^{y), Vi G r 

is a partial order. If {Y, >y) is a totally ordered space then the same holds for (X/^, >^). Moreover 
Af^ is a family of nondecreasing functions (hence they are correlated). 

Proof. It is straightforward to show that >^ is a well-defined partial order (clearly it does not 
depend on the choice of x (and y) within an equivalence class). We prove that, if >y is a total 
order, the same holds for >^. Indeed if [x] / [y] then there exists i G T such that fi{x) 7^ fi{y); 
suppose that fi{x) > fi{y) then, by equation (|2.2|) . [x] >^ [y]. It is trivial to prove that (pf- is 
nondecreasing for every i G F, whence they are correlated since the space (X/^,>^) is totally 
ordered. □ 

A subset I of an ordered set, say Y, is called an interval if and only if for all x,y & I and z gY 
then X >Y z >y y implies z G I. Note that given an interval / C y then (l)J^{I) is an interval of 
X/^ for every i G F. 

Given x,y G X such that [x] >^ [y] we define the interval [[y],[x]) := {[z] G X/^ : [y] < 
[z] < [x]}; the intervals [[y],[x]], ([y], [x]] and ([y], [x]) are defined analogously. In particular for 
any x G X, we denote by [[x],+oo) and (— cxd, [x]] the intervals {[y] G X/^ : [y] >^ [x]} and 
{[y] G X/^ : [x] >^ [y]} respectively. 

2.2. Induced c-algebra and measure. This construction can be carried on under general as- 
sumptions. Let us consider a measurable space with a positive measure {X,T,x,n) and an equiva- 
lence relation ~ on X such that for all x € X and A G Sx, 

(2.4) X e A^[x]CA. 

There is a natural way to construct a cr-algebra on X/^, namely define 

:= {7r{A) : A G Sx} 

where 7r{A) '■= {[x] : x G A}. This is the largest u-algebra on X/^ such that the projection map vr 
is measurable. Observe that A t^^A) is a bijection from T,x onto S^. It is natural to define a 
measure fl := fi-j^ by 

nHA)) = fi{A), yA G Sx. 

It is well known that a function g : X/^ ^ M is measurable if and only if vr^ is measurable. 
Moreover g is integrable (with respect to Jl) if and only if vr^ is integrable (with respect to /i) and 

(2.5) / TTgdfj, = / gdJI. 

Jx Jx/^ 

We say that a function g is integrable if at least one of the integrals of the two nonnegative functions 
g~^ := max{g, 0) and g~ := — mm{g, 0) is finite; hence the integral of g can be unambiguously 
defined as the difference of the two integrals (where =boo + z := ±00 for all 2: G M and • ±00 := 0). 
This notion is sligthly weaker than the usual one: to remark the difference, when the integrals of 
g~^ and g~ are both finite the function g is called summable. 

It is a simple exercise to check that the equivalence relation defined in Section 12.11 satisfies 
equation (j2.4p if Sx = <7(/i : ? G F) (that is, T,x is the minimal u-algebra such that all the 
functions in J\f are measurable); this equivalence relation along with its induced cr-algebra and 
measure will play a key role in the next section. 

Remark 2.1. It is easy to show that ii h,r : X are two integrable functions such that the sum 
f-^ hdfj, + f-^ rdfj, is not ambiguous (i.e. it is not true that hd^ = ±00 and f-^ rd^ = ^00) then 
h-\- r is integrable and 

(2.6) / {h + r)dn= / M/u + / rd/i 

Jx Jx Jx 

(both sides possibly being equal to ±00). This will be useful in the proof of Lemma 13.31 

3. Main result 

Throughout this section we consider a measurable space with finite positive measure {X, T,x , fJ-) 
and a family of correlated functions M = {/ijier, where Ex = o"(/j : i G F). Let us consider 
y = M with its natural order >. The equivalence relation ~, the (total) order >^ and the space 
(X/^,S^,7l) are introduced according to Sections 12.11 and 12.21 It is clear that contains the 
cr-algebra generated by the set of intervals {(l)J^{I) : i G F, / C M is an interval}. More precisely 
it is easy to see that, by construction, all the intervals of the totally ordered set {X/^, >^) are 
measurable since A/"^ separates points. 

The main result is the following. 

Theorem 3.1. Let ^{X) < +00. 

(1) If f , g are two integrable, /i-a. e. correlated functions such that fg is integrable then 

(3.1) / fgdfi > [ /d/x / gdfi. 

Jx Jx Jx 

Moreover, if f , g are summable, then in the previous equation the equality holds if and only 
if at least one of the functions is fi-a.e constant. 
(2) If {fi}i=i be a family of measurable functions on X which are nonnegative and fi-a.e. cor- 
related then 

(3.2) Kxf-^ n/*d/.>n/ /.d^- 

Jx -^j^ -^^^ Jx 

Moreover if fi^l^ ^ (0, +oo) for alii = 1, . . . ,k, then in the previous equation the equality 
holds if and only if at least k — 1 functions are fi-a.e. constant. 

Before proving this theorem, let us warm up with the following lemma; though it will not be 
used in the proof of Theorem 13.11 nevertheless it sheds some light on the next step. 

Lemma 3.2. Let M := {{a^i(i)}jeN}j=i be a family of nonnegative and nondecreasing sequences 
and {/UjjjgN be i family of strictly positive real numbers. IfYlil^i < then 

(3.3) y^i^i) n ^i^3)iJ-i > n ^i(.?')/^»- 

i i j=l j=l i 

Moreover if for every j we have < Yli^iU) < +°*-^ then the equality holds if and only if at least 
k — 1 sequences are constant. 

Proof. We prove the first part of the claim for two finite sequences {xj}"^^ and since the 

general case follows easily by induction on k and using the Monotone Convergence Theorem as n 
tends to infinity. 

It is easy to prove that 

n n n n 

(3.4) ^/Uj^Xiyi/ij-^Xi/ii^yi/Xi = ^ {xi-Xj){yi-yj)^ifij = ^ {xi-Xj){yi-yj)^i^ij. 

i=l i=l i=l i=l ij'-i^j ij'-i>j 


71 n n 

^ /ii ^ Xiyifii = ^ {xiyt + Xjyj)^ij2j + ^ Xiyifi^ 

i=l i=l i;j'-i>j *=1 


n n n 

^ Xi^^i ^ yifii = ^ {xiyj + Xjyi)niHj + ^ Xiynxf. 
This implies easily that 

n n n n 

^ /Xj ^ Xiyiiii - ^ Xiiii ^ yj/Xj > 0. 

i=l i=l 1=1 i=l 

If either at least k — 1 sequences are constant or one sequence is equal to 0, then we have an 
equality. The same is true if Xj(j)/ij = +oo for some j and J2i XiU)f^i > for all j, since both 
sides of equation (I3.3p are equal to +oo. On the other hand by using the first part of the theorem 

and by taking the limit in equation ()3.4p as n tends to infinity, for all 1 < ii < ^2 ^ 

i j=l j=l i 


>(y^^^i)^xi{il)xi{i2)^li J| ^xi{i)ixi-\Y^xi{i)iii 

j¥=ji,h « i,ii:i>ii 

If both {xi{ji)}i and {xi{j2)}i are nonconstant then there exist r < / and ri < li such that Xriji) < 
xi{ji) and Xri{j2) < xi^{j2). This implies 

in) (ii) > and Xmax(i,/i)02) - 

^min(r,ri)(j2) > 0, thus the right hand side of equation (|3.5|) is strictly positive (just consider 
the summation over : i > max(/,Zi),ii < min(r, ri)}) and we have a strict inequality in 

equation (j3.3p . □ 

The proof of the previous lemma clearly suggests a second lemma which will be needed in the 
proof of Theorem 13.11 

Lemma 3.3. Let Af := {/, g} where f,g:X^M are two summable functions such that fg is 
integrable (for instance if f and g are fi-a.e. correlated). If ^{X) < +00 then 

K^) / f{x)9{x)dfi{x) = / f{x)dfx{x) / g{x)dfx{x) 


+ 1^ / {fix) - fiy))i9ix) - 5(y))d/i(x)d/i(2/). 

Proof. Note that 

(3.7) fix)gix) + f{y)g{y) = fix)g{y) + f{y)g{x) + (/(x) - fiy))igix) - g{y))- 

where fix)g{y) and fiy)gix) are summable on X x X, since /, 5 are summable. If we define 
hix.y) := fix)g{y) + f{y)gix) and r{x,y) := (/(x) - f{y)){g{x) - g{y)) then, according to Re- 
mark im we just need to prove that h and r are integrable (since /i + r is integrable by hypothesis). 

If /, g are summable then, by equation (|3.7|) . fg is integrable if and only if (/(x) — f{y)){g{x) — 
giy)) is integrable on X x X (since the sum of an summable function and an integrable function is 
an integrable function) and equation (|3.6|) follows. Clearly if / and g are correlated then {fix) — 
f{y)){g{x) — g{y)) is nonnegative thus integrable. 


Proof of Theorem 

(1) By equation (|2.5p it is enough to prove that 

/ (pfCpgdjL > / (pfdjl + / (t>gdjl. 

Jx/^ Jx/^ Jx/^ 

If / and g are summable then the claim follows from equation (j3.6p of Lemma 13.31 Oth- 
erwise, without loss of generality, we may suppose that ^/d^^ = fdfi = +00. If 

J^^ <j)gdj[ = gdfi < then there is nothing to prove. If f-^ gdfj, > then either g = 
//-a.e. , in this case both sides of equation (13. ip are equal to 0, or there exists x £ X/^ 
such that 7l([x,+oo)) > and 4>f,4>g > on [x,+oo) (since and (pg are nondecreasing) . 

Clearly J[^^_^_^)4)fdn = +00 and 4>f{y)4>g{y) > 4>fiy)(l>gix) for all y G [a;, +00), hence both 
sides of equation (j3.ip are equal to +00. 

If one of the two functions is constant then the equality holds. If / and g are nonconstant 
(that is, (j)f and (pg are nonconstant) then there exists xo,yo ^ such that xq >~ yo, 
(j)f{xo) > 0/(yo), 4>g{xo) > (pgiyo), 'Pi{-oo,yo]) > and 7Z([xo,+oo)) > (this can be done 
as in Lemma l3.3p . Hence, using equation (j3.6p . we have that, 

> ( / ihi^) - My))iM^) - My)Wi^W{y)) 

^ i[xo,+oo)x(-oo,yo] 

>7l((-oo,2/o])7x([xo,+oo))((?:)/(2;o) - cl)f{yo)){cl)g{xo) - (t)g{yo)) > 0. 
(2) Let us suppose that fi is summable for alH = 1, . . . , A;. It is enough to prove that 

JX/^ i=l i=i J^l- 

In the previous part of the theorem, we proved the claim for two functions and (^g\ as 
in Lemma |3.'2| the general case follows by induction on k. 

If at least two functions are nonconstant, say ^^^^ then as before we may find 
2^0, yo e Xj^ such that xq >^ yo, 4'fi{xo) > 4'fiiyo), 4>f2{xo) > (^/aCyo), 7^((-oo,yo]) > 
and 7^([xo, +00)) > (this can be done as in Lemma [3^3]) . By applying the first part of the 
claim to the family (of k — 1 functions) (j)fj^(j)f^, (pj^, . . . , (f)f^ (which are clearly still correlated 
since they are nondecreasing) and using equation (j3.6p we have that. 

j^/^ i=i i=i j^i-^ 


= U{X/^)l (l)f^(j)f^d]I - (Phd-p- ^/ad/i) n / ^f^^'P 
^ Jx/^ Jx/^ Jx/^ ' Jxj^ 


(0/i(a;) - (t>fi{y)){4>f2(.x) - '/'/2(y))dA^(a;)d^(y)j / 0/^d/i 
>7l((-oo,yo])7l([xo,+oo))((/)/,(a;o) (yo))(0/2(2;o) -0/2(yo))n / ^fr^P > ^ 

i=3 -^^/^ 

since < j-^^ cpjAjiK +00 for all i = 1, ... , k, thus the second part of the claim is proved. 


Note that if fidfi = +00 for some i and fjdfi > for all j (otherwise both sides of 
equation ()3.2p are equal to 0) then both sides of equation ()3.2p are equal to +00; indeed apply 
the first part of the theorem to the family of correlated bounded functions {mm{ fi,n)}^^i (where 
n € N) and take the limit of both sides of equation (13. 2p as n tends to +00. 

Remark 3.1. According to Theorem 13. H there is a difference between the case k = 2 and k > 2; 
indeed in the latter case the inequality cannot be proved for integrable (or even summable) //- 
a.e. correlated functions which are not nonnegative. Something happens in the inductive process, 
namely if are correlated this may not be true for {/i/2,/3, • • • , fk} (if the functions are 

not positive). Here is a counterexample: take X = [—1,1] endowed with the Lebesgue measure, 
fi{x) = f2{x) ■■= xl[_ifl-\{x) and fi{x) := x - fi{x) for all i > 3. 

Strictly speaking. Theorem 13 . 1 1 could be proved without the constructions of Sections l2 . 1 1 and [2^21 
one has just to use carefully equation ()2.3p and Lemma [3^ Our approach simplifies the proof of 
Theorem 13.11 and gives a better understanding of the role of the correlation hypothesis (compared 
to the usual monotonicity) . 

We finally observe that if we consider two integrable anticorrelated functions (meaning that 
(/(x) — f{y)){g{x) — g{y)) < for all x,y e X) such that fg is integrable then, clearly, we have 

4. Final remarks and examples 

Let us apply Theorem [3] to a class of power series. We consider f{z) := Ylin=i ^nz"" where 
{an}n is a sequence of nonnegative real numbers and we suppose that {p"an} is nonincreasing 
(resp. nondecreasing) for some p such that < p < R (where R is the radius of convergence). Then 
the function z i— > (p — z)f{z) is a nonincreasing (resp. nondecreasing) on [0,p). 

Indeed if we suppose that {p^a^} is nonincreasing then, for all 2,7 such that < z < 7 < p, we 


5^a„z" = ^a„p"(z/7)"(7/p)" 



E+00 n +°° 



where, in the first inequality, we applied Theorem 13.11 to the (correlated) functions /i(n) := an/o" 
and /2(n) := [zj^Y" defined on N endowed with the measure := X]nGA(7//')"- '^'^^ when 
{p"a„} is nondecreasing is analogous (observe that now the functions f\ and are anticorrelated). 
\i z < p < R then f\ and are nonconstant functions, hence the function 2; 1-^ (p — z)f{z) is 
strictly monotone. 

We draw our second application application from probability theory. To emphasize this, we 
denote the measure space by {i},J-,¥) and we speak of random variables and events instead of 
measurable functions and measurable sets respectively. We note that if A; = 2 then Theorem 13.11 
says that correlated variables have nonnegative covariance that is, E[/i/2] — E[/i]E[/2] > (where 
E[/] := /dP is the usual expectation). 

We call the (real) random variables {Xq , Xi , . . . , Xf^. } independent if and only if, for every family 
of Borel sets {^0, ^1, • • • , ^fc}, we have P(nf^o{Xi e ylj) = HLo ^i^i ^ where ¥{Xi € Ai) is 
shorthand for F{{uj e ft : Xi{uj) e Ai}). 

In order to make a specific example, let us think of the variable Xi (i = 1, . . . ,k) as the (random) 
time made by the i-th contestant in an individual time trial bicycle race and let Xq be our own 
(random) time; we suppose that each contestant is unaware of the results of the others (this is the 
independence hypothesis). If we know the probability of winning a one-to-one race against each of 
our competitors we may be interested, for instance, in estimating the probability of winning the 

race. Such estimates are possible as a consequence of Theorem I3.1| indeed we have that 


'intl{x^ > Xo}) > HnXi > Xo) 


p(nti{Xi < Xo}) > WnXi < Xo). 


Thus the events {{Xi > Xo}}f=i (resp. {{Xi < Xo}}f^ i) are positively correlated (roughly speaking 
this means that knowing that {Xi > Xq} makes, for instance, the event {X2 > Xo} more likely 
than before). 

The proof of these inequalities is straightforward. If we define ^{A) := ¥{Xo € A) for all Borel 
sets ^ C M, then, according to Fubini's Theorem, 


>Xo)= [ F{Xi > t)dfi{t), n^tiiXi > Xo}) = [ f\nXi > t)d^(t) 

JR JR ~^ 


p(x, < Xo) = / P(x, < t)dfi{t), P(nii{Xi < Xo}) = / TTP(Xi < t)dfi{t). 

JR JR,, 



F{Xi > Xq) = [ di^{s)dn{t) =11 dv{s)dii{t) = [ F{Xi > t)dfi{t) 

J{{s,t)eR'^:s>t} JRJ[t,+OD) JR 

where i^{A) := P(Xj £ A) for all borel sets ^ C R and the first equality holds since Xj and Xo are 
independent. The remaining cases are analogous. Note that {P(Xj > t)}^^^ and {P(Xj < t)}f=i are 
both families of monotone (thus correlated) functions; Theorem 13.11 vields the claim. This example 
can be easily extended to a more interesting case: namely when {Xi, . . . ,Xfc} have identical laws 
and are independent conditioned to Xo (see Chapters 4 and 6 of [1] for details). In this case one 
can prove that 


P(n^=i{X, G A}) > ]JP(Xi eA), C E Borel set. 


The proof makes use of Theorem 13.11 in its full generality but this example exceeds the purpose of 
this paper. 

The author thanks S. Mortola for useful discussions. 


[1] P. Billingsley, Probability and measure, Wiley Series in Probability and Mathematical Statistics, John Wiley & 
Sons, New York, 1995. 

[2] R.A. Brualdi, Mathematical Notes: Comments and Complements, Amer. Math. Monthly 84, (1977), n. 10, 

[3] S.K. Stein, An inequality in two monotonic functions, Amer. Math. Monthly 83, (1976), n. 6, 469-471. 


MiLANO, Italy. 

E-mail address: 
URL: http : //wwwl .mate .polimi . it/~zucca